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Abstract 

Overheads incurred by routing protocols diminish the capacity available for relaying useful data in a mobile wireless ad 
hoc network. Discovering lower bounds on the amount of protocol overhead incurred for routing data packets is important for 
the development of efficient routing protocols, and for characterizing the actual (effective) capacity available for network users. 
This paper presents an information-theoretic framework for characterizing the minimum routing overheads of geographic routing 
in a network with mobile nodes, specifically, the minimum overhead problem is formulated as a rate-distortion problem. The 
formulation may be applied to networks with arbitrary traffic arrival and location service schemes. Lower bounds are derived for 
the minimum overheads incurred for maintaining the location of destination nodes and consistent neighborhood information in 
terms of node mobility and packet arrival process. This leads to a characterization of the deficit caused by the routing overheads 
on the overall transport capacity. 
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I. Introduction 

J> ' "M JT OBILE ad hoc networks are characterized by dynamically changing network topology which makes routing packets 
1 t M in an ad hoc network a very challenging problem. Routing protocols either fail to cope with the changing topology 
f^) ■ and yield low packet delivery rate or incur very high overhead. It is important to understand the minimum overhead incurred 
£^ ■ for routing packets with certain level of reliability. Knowledge of such a fundamental overhead limit would not only allow 
f**«« . researchers to know how much a protocol deviates from the theoretical minimum, but also inspire the development of routing 
, protocols that achieve the limit. 

We view that the primary goal of a routing protocol is to gather and disseminate state information such that a node may 
^ | take packet forwarding decisions that satisfy certain performance criteria. The state information may be comprised of link 
' states, node locations, velocity and direction of nodes, queue lengths, etc. The performance criteria could be minimum delay, 
j_i ■ maximum throughput, maximum lifetime, delivering certain fraction of packets or simply best effort delivery of packets. 



It is difficult to compare the overhead of various routing protocols since as yet there exist no absolute bounds on the 
minimum overhead incurred by the well known protocol classes. The premise of this work (as well as related prior work [1]) is 
that, instead of asking "what is the best routing protocol?" we could characterize the routing overheads incurred by a few key 
classes of protocols. The work in [1] addressed the overheads for proactive routing protocols. This paper considers geographic 
routing, which has become very commonly used as a network layer paradigm for multi-hop wireless networks. 

In this paper we present an information-theoretic framework for characterizing the minimum overhead incurred by geo- 
graphical routing protocols. Information theory provided us with lower bounds on the minimum number of bits required to 
encode a source. Thus, it is reasonable to expect that information theory would be a suitable tool for developing lower bounds 
on the amount of overhead incurred by routing protocols for disseminating and gathering state information in a mobile ad hoc 
network. 

A preliminary version of this paper has been accepted to be published in the proceedings of 26 th Annual IEEE Conference on Computer Communications 
(IEEE INFOCOM'07). 
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In geographic routing each node maintains its location information at one or more location servers (e.g. see [2], [3]). When 
a source wants to forward a packet to a destination, it queries an appropriate location server for the location of the destination. 
The location server replies to the source node with the available location information. Thereafter, the source and intermediate 
nodes forward the packet according to the location of the destination. It is pointed out in [4] that the fraction of packets 
delivered by geographical routing varies inversely with the average error in location information stored at the location servers. 
Thus, maintaining packet delivery ratio above a given threshold corresponds to maintaining location errors below a certain 
threshold. 

We categorize geographic routing overheads into two categories: (i) Location update overhead: The overhead incurred in 
updating the location servers such that the location errors in the reply to location queries is less than e, and (ii) Beacon 
overhead: The overhead incurred in beacon transmission such that the probability that a node has consistent neighborhood 
information when it needs to forward a packet is greater than 1 — 5. We formulate the problems of finding the minimum 
values of the above-mentioned overheads as rate-distortion problems. For location update overheads, the distortion measure 
used is squared error in the location information stored at the location servers (squared error distortion measure). For beacon 
overheads, the distortion measure is the probability that a perceived neighbor is not an actual neighbor (Hamming distortion 
measure). Using a rate-distortion formulation, we present lower bounds on the minimum geographic routing overhead incurred 
in terms of node mobility, packet arrival process, and reliability criteria e and 6. First, we consider one-dimensional network 
case and then extend the results to two-dimensional networks. 

We then unite the results obtained here on the minimum geographic routing overheads with the results on the transport 
capacity of stationary multihop wireless networks evaluated in [5], in order to characterize the effective capacity available for 
users. It is observed that when the node mobility is high and the average packet inter-arrival time is sufficiently small, the 
complete transport capacity of an ad hoc network may be consumed by routing overheads. We derive an upper bound on the 
critical network size above which all the transport capacity of the network would be consumed by the routing overheads and 
no useful communication would be possible. In this paper we only consider the scenario where the routing protocol initiates 
the forwarding process as soon as a packet is generated at the source. Thus the potential capacity improvement due to node 
mobility (achieved at the cost of delay associated with waiting for the destination to move to a nearby location) pointed out 
in [6], [7] is not applicable to our work. 

A. Main Contributions 

The main contributions of this paper may be summarized as follows: 

1) We present a new information-theoretic formulation for evaluating the minimum routing overhead incurred by geographic 
routing. The formulation is general so that it may be applied to any node distribution, packet arrival process, and may 
be extended to any location service scheme and mobility model. 

2) For Brownian mobility model and various packet inter-arrival time distributions, we evaluate lower bounds on the 
minimum rate at which a node must transmit its location information and beacons such that the packets are routed 
with desired level of reliability. Combining both overheads, we find a lower bound on the capacity deficit caused by 
geographic routing overheads in mobile wireless ad hoc networks. 

3) We characterize the effective transport capacity of an ad hoc network after taking into account the minimum routing 
overheads that must be incurred for reliable geographic routing. 

4) For a given packet arrival process, standard deviation of Brownian motion and reliability parameters (e, 5), we evaluate 
the upper bound on the number of nodes the ad hoc network can support such that the complete transport capacity of 
the network is not used up by routing overheads. 

B. Paper Outline 

The rest of the paper is organized as follows. A brief overview of related work is presented in Section[II] The network model 
is presented in Section [Til] The rate-distortion formulation and evaluation of a lower bound on the minimum position update 
and beacon overheads are presented in Sections [IV] and [V] respectively. A discussion of the capacity deficit caused by routing 
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overheads is presented in Section |VT] The application of the formulation to other scenarios and some possible extensions of 
the network model is discussed in Section IVHI We present conclusions and directions for future research in Section IVIIII 

II. Related Work 

So far, we believe information theory has not significantly influenced the design and understanding of communication network 
protocols (an opinion we share with the authors of [8]). One of the earliest (and most significant) attempts in using information 
theory to enhance the understanding of communication networks was made in [9]. Gallager [9] used an information-theoretic 
approach in order to characterize a lower bound on the amount of protocol information required to keep track of the sender, 
receiver and timing of messages for a simple (stationary) network model. It is found that although the introduction of message 
delay decreases the protocol information, small average message length and high message arrival rate may lead to prohibitively 
high protocol overhead. 

A few relatively recent papers have used information theory to understand the effects of node mobility on wireless networks. 
An analytical framework, based on entropy of node location, for characterizing delay and overhead associated with paging and 
routing a call to a mobile station in a cellular environment is provided in [10]. The complexity of tracking a mobile user in a 
cellular environment is studied using a information-theoretic approach and a position update and paging scheme is proposed 
in [11]. An entropy based modeling framework for evaluating and supporting route stability in mobile ad hoc networks is 
proposed in [12]. In [13], the authors propose the entropy of link change as the metric for mobility models against which 
performance of wireless network protocols could be evaluated. 

The overhead incurred by routing protocols and their scalability properties has been studied in [1], [14]-[23]. The initial 
studies [14], [15], [17], [18] are mainly simulation based. These studies point out that none of the routing protocols performs 
well across all scenarios. Instead each protocol performs well in some scenarios and bad in others. 

Although simulation-based studies provide useful information about the performance of routing protocols, the observations 
may not be generalized to all scenarios. Therefore it is important to have analytical results for the performance of routing 
protocols in order to be able to develop a deep and general understanding of the trade-offs involved. In [16], [20], the authors 
present an analytical framework for characterizing the routing overhead for ad hoc routing protocols. Asymptotic results are 
provided for the overhead of proactive and reactive protocols in terms of network and routing protocol parameters such as 
packet arrival rate, hello packet transmission rate, hello packet size, size of route request packet, topology broadcast rate etc. 
The overheads of specific routing protocols were modeled in [19]. The impact of the routing-layer traffic patterns (in terms 
of number of hops between source-destination pairs) on the scalability of reactive routing protocols in ad hoc networks with 
unreliable but stationary nodes is studied in [22], [23]. It is found that the reactive routing protocols may scale infinitely (i.e. 
the routing overhead does not tend to infinity) with respect to network size for some traffic patterns. The analysis is extended 
to and similar results are obtained for cluster-based routing algorithms in [21]. 

Our work is along the lines of [1] where the authors use an information-theoretic approach to characterize the minimum 
routing overhead and memory requirements of topology-based (proactive) hierarchical routing protocols for ad hoc networks. 
The entropy of ad hoc network topologies as well as the entropy rate are used in [1] to find the above mentioned bounds. 
However, here, the family of routing protocols considered is geographic routing protocols and the performance constraints 
are also taken into account. This leads to a new problem formulation as rate distortion which was not considered in earlier 
work and new results on the effect on transport capacity. Some preliminary results of our work appeared in [24]. Several 
significant contributions are made by this paper in addition to [24]. In this paper we analyze beacon overhead for various 
packet arrival processes, evaluate tighter lower bounds on location update overhead, extend the overhead analysis to other 
scenarios of geographic routing, and include new results of capacity deficit obtained through numerical analysis. 

III. Network Model 

The network consists of n mobile nodes. The nodes perform Brownian motion with variance a 2 . We consider two kinds of 
network deployments: (i) One dimensional case: nodes located along a circle of perimeter L, and (ii) Two dimensional case: 
nodes located over a torus of surface area A. The central and lateral radii of the torus are denoted by R c and Ri respectively. 
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The closed curve and surface are chosen for the study, instead of a finite line or a rectangle, in order to avoid the complexity 
of modeling the behavior of Brownian motion at boundary points. 

We assume that L » a 1 and R c ,Ri >> <J 2 ■ The large dimensions ensure that the nodes do not wrap around the curve 
or surface during small intervals of time. So if we look at the motion of a node during a small interval of time, then with 
probability almost one the motion is similar to Brownian motion on an infinite line or plane with the initial node position as the 
origin. Thus, in the rest of the paper, we treat the motion of nodes during the time scale corresponding to packet inter-arrival 
times as motion on a plane or straight line. Over time the nodes do not drift apart from each other, as they would on a infinite 
line or plane, but just keep moving around on the circle or the torus. 

Conversely, this may be viewed as if we are observing the Brownian motion of the nodes on an infinite line or plane and 
mapping their positions back on the circle or torus, respectively. For example, consider a node that performs Brownian motion 
along the x-axis and whose initial position is the origin. At time t, suppose the node is located at X(t), Then it may be mapped 
to a point mod away from the initial position of the node on the circle, with distance measured in counter-clockwise 

direction. Similar mapping is possible in the case of torus by considering an infinite plane. Thus instead of keeping track of the 
positions of nodes on the circle or torus, we use the coordinates of the nodes on x-axis and infinite plane. This scheme works 
since we are only interested in the change in positions of nodes during packet inter-arrival periods. The coordinates of nodes 
are denoted by Xi{t). Hence Xi(t) = {Xn(t)} and Xi(t) = {Xn(t), Xi2(t)} for one and two-dimension case respectively. 
The location information of node i available at the location server at time t is denoted by Xi(t), hence, Xi{t) = {Xn(t)} and 
Xi(t) = {Xn(t), Xi2(t)} for one and two-dimension case respectively. 

The j th packet destined to destination i generated at a node (source of the j th packet) in the network at time Ti(j), V 
j > 1. Define X^O) = V 1 < i < n. For all j > 1, let Sj = Tj(j) - T^j - 1) denote the packet inter-arrival time which 
is independently and identically distributed (i.i.d.) with pdf fs(t), such that /s(i) — Vt < and E[S] exists. Similarly let 
Ti(k) denote the time at which the k th packet is forwarded by node i, with Ti(0) = 0. The forwarded packets include both the 
packets generated by node i and the packets for which the node acts as an intermediate relaying node. The inter-arrival time 
of the forwarded packets, n(k + 1) — Ti(k) V k > 0, is an i.i.d random variable whose pdf denoted by f T (t). 

We assume a GHLS [25] like location service scheme. For each node, a hashing function is used to map the node id to 
a home region within the deployment area. A node within the home region acts as the location server for the node. A node 
sends updates its location information by sending update packets destined to its home region. The packets are forwarded using 
geographic routing. The location server, located within the home region, receives these update packets and maintains the location 
information of the node. Since the nodes are mobile, the members of home region change over time. A lightweight hand-off 
mechanism, proposed in [25], may be used to hand-over the task of maintaining location information to new nodes when 
a location server leaves the home region. The hand-off mechanism does not require transmission of extra control messages. 
The current location server uses only the beacons received from neighbors in order to ascertain if it needs to hand-over the 
responsibility of maintaining the location information. We discuss extensions of our work to other location service scheme 
models in Section [VTll 

The communication radius of each node is r meters. When a new packet generated at a source node, it queries the location 
server of the packet destination for the location of the destination. The packet is routed to the destination according to greedy 
geographic forwarding using the destination location information returned by the location server. It is assumed the position of 
a destination does not change significantly while the location server is being queried by the source and the packet is being 
forwarded through the network. In other words the time scale of forwarding a packet is much smaller than that required for 
a significant change in position. Also the network is assumed to be always connected such that nodes can communicate with 
the desired location servers. 

IV. Location Update Overhead 

In this section we evaluate a lower bound on the minimum rate at which a node must transmit its location information such 
that the average error in its location stored at the location server is less than e whenever the server is queried. We first introduce 
the notation and rate-distortion formulation, followed by analysis for one-dimensional and two-dimensional networks. We also 
evaluate lower bounds for deterministic, uniform and exponential packet arrival processes. 
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A. Notation and Rate-Distortion Formulation 

Definition 1: Di(t) is the squared-error in the location information of destination i available at its location server at time t, 
i.e., 

D i (t) = \X i (t)-X i (t)\ 2 (1) 



where \X t (t) X t (t)\ = (x^t) X tJ (t)) . 

Definition 2: X^ — {Xj(Ti), Xj(Ti), . . . , Xj(Tjv)} is the vector of locations of destination i at time instances Tj, 1 < j < 
N. Similarly X-* = {Xi(Ti), Xi(T 2 ), . . . , X,(Tjv)} is the vector of location information at the location server of destination 
i at time instances Tj, 1 < j < N. 

Definition 3: Xf and Xf are sets of all possible vectors X-* and Xf , respectively. 

Definition 4: P N [xf;xf] denotes the probability that X t N = xf and X? = xf, where xf £ X t N and xf e X t N . 
Definition 5: DiN is defined as 

1 N 

where i?[Di(Tj)] is given by 

£?[A(T J -)]= E E ^^f^flAffi) (3) 

Definition 6: Vn(^ 2 ) is defined as the family of probability distribution functions Pn[x^ ;xf] for which Dj/v < e 2 - 
Definition 7: i?/v(e 2 ) is defined as the iV th -order rate-distortion function - the minimum rate at which a destination must 
transmit the location information such that the DiN < e. According to [26], i?jv(e 2 ) is given by 

^ (e2) = P,fAV)^- (xf;lf) (4) 

where Ip N (Xf*; X^) is the mutual information between Xf and Xf . 

The minimum rate at which a destination must update its location information such that a large fraction of packets are 
delivered, represented by i?(e 2 ), is given by 

R(e 2 ) = lim mmR N {e 2 ) (5) 

N^oo 

B. Location Overhead for One-Dimensional Networks 

Lemma 1: The mutual information between X? 1 and X?* satisfies the following relationship 

inf 7 Pjv (Xf;Xf)> NR^e 2 ) (6) 

Proof: This proof is very similar to the proof of [9, Theorem 3]. 
Using the standard definition of mutual information we get 

I Pn {X? ; X") = H{X») H{X»\X») (7) 

Now consider H(X? |Xf ) 

TV 

HiXflX?) = HiXaWlX") -J2 H ( x n ( T i )\X ll {T 1 ),X ll (T 1 ), . . ..la^^f) 
Since conditioning cannot increase the entropy, we have 

N 

H{X?\X?) < HiXaiT^X^ + ^mniT^XaiTj-^XaiTj)) (8) 

1=2 
N 

= mXa^X? ) + Y,H(X il (T j ) -XaiTj-^XaiTj^Xapj)) (9) 

J=2 
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(O follows from ([8j since H(X i i(Tj)\X i x(Tj^i)X i i(Tj)) is already conditioned on Xu(Tj-i), subtracting it from Xn(Tj) 
is similar to translating the random variable by a scalar. Define a new random variable, Yj (1 < j < N — 1), such that 

Yf = XatTj) - Xnpj-t) (10) 

Since Yj depends only on Xn(Tj) and Xji(Tj_i), we have 

H(Xn(Tj) — Xii(Tj-i)\Xn(Tj-i),Xn(Tj)) = HiX^Tj) - Xa^T^Y^X^T^X^Tj)) (11) 

< H(Xn(Tj) — Xn(Tj-x)\Yj) (12) 

Equation (fT2l follows from ( fTTT l because conditioning does not increase entropy. Thus we get the following upper bound on 

ff(Xf|Xf) 

JV 

H(X^\X t N ) < HiXai^XaiT^+^HiXaiT^-XaiT^Yj). (13) 

i=2 

Now consider H(X^), since XfiCZj) — Xji(Tj_i) are independent of each other, we have 

JV 

H(X?) = H(X il {T 1 ))+Y,H(X il {T j )-X a (T j - 1 )) (14) 
Combining (0, ( fT3l and (TBI , we get 

JV 

I PN (X t N ;X t N ) = /(^(TOii^T^+^/^tT,) - X,,(7} t):^) (15) 

i=2 

Notice that, the squared error in the location information may also be written as 

D i( T j) = \*j - (Xii(Tj) - XaiTj-x))] 2 = \X t {T 3 ) - Xi(Tj)\ 2 (16) 
Let dj — E[Di(Tj)}. Since Xn(Ti) has the same distribution as Xn(Tj) — Xn(Tj-i), and ( TTBT l is satisfied, therefore we have 

I(I(X a (T 3 ) - X u (T :j ^Yj) > R^dj) V j > 2 (17) 

Define d % = £[|X a (Ti) - l 4 i(Ti)| 2 ], then by substituting (fTTb in (1X31 and using the convexity of the rate distortion function 
R\, we get 



JV ( JV \ 

i=i \ j'=i / 



(18) 



Now since P N £ V N , (1/iV) £^Li ^ = D iJV < e 2 , therefore from d we have I Pn (Xf ; ) > NRi{e 2 ). ■ 
^From Lemma Q] and the definition of rate distortion function and <(5j it follows that 

i?(e 2 ) > i?i(e 2 ) (19) 

The following theorem provides a lower bound on the minimum rate at which a destination must update its location information. 
Theorem 1: The lower bound on the location update rate (in bits per packet) is given by 

i?(e 2 ) > hiXaiTx)) - ilog27ree 2 (20) 

where h(Xn{T\)) is the differential entropy of the location of destination i at the time when the first packet destined to it is 
generated in the network. 

Proof: ^From ( fT9l we know that the minimum update rate is bounded by i?i(e 2 ), which in turn is defined as 

Ri(e 2 )= inf Jpj (Xii (Ti );Xn(Ti)) 

Pi £Pi 
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Now consider I Pl (Xn (Ti);Xn (Tx)), 

Ip^XaiT^XaiTx)) = /i(JQi(Ti)) — h(Xix(Tx)\Xix(Tx)) 

= hiXaiTx)) - hiXaiT,) - XaimXaiTx)) (21) 

> hiXaiTj)) - hiXapt) - XaiTi)) (22) 

> /i(X a (Ti)) - ft(JT i i(A/'(0,£7[(X i i(Ti) - X a (Ti)) 2 ])) (23) 

> ^(X il (r 1 ))-ilog(27ree 2 ) (24) 

Here d22l follows from (fJTJ since conditioning does not increase entropy, d23b follows from d22l since for a fixed variance, 
the normal distribution has the highest differential entropy, and (t24b follows from d23l since for Pi € Pi, E[(Xn(Ti) — 
^i(Ti)) 2 ] < e 2 . Thus, 

i?i(e 2 ) > h(X a (Tx)) - X - log (27ree 2 ) (25) 

and d20b follows directly from it. ■ 
It should be noted that in under some situation h(Xix(Tx)) may be less than | log (27ree 2 ). Such situations may occur when 
a 2 and/or E[S] is small. Under these circumstances, the change in position of node between two packet generation instances 
may be comparable to the fidelity criterion (e 2 ) and hence small number of bits, if any, may be required to represent the 
change in position of a node between packet generation instances. However, over the course of time the position of a node 
may change appreciably which may require it to update its location information. When h(Xix(Tx)) < \ log (27ree 2 ), the right 
hand side of of inequality < f20b is meaningless. A more appropriate inequality will be 



R(e 2 ) > max \h(X a (Tx)) - - log (27ree 2 ) , Oj (26) 

We discuss the case where h(Xn(Tx)) < \ log (2-Kee 2 ) in detail in Section [VIT1 

Theorem [T] implies that the minimum update rate largely depends on h(Xn(Tx)), which in turn depends on two factors: 
(i) the mobility pattern of the destination node and (ii) the packet inter-arrival process. Let fxi{%) denote the pdf of Xix(Tx) 
(without loss of generality, JQi(To) = 0). For Brownian motion with variance a 2 and packet inter-arrival time distribution 

fs(t), fxi(x) is given by 

r°° i 2 

f Xl (x)= = e-^f s (r)dT (27) 

Jt=0 V27T(T Z T 



h{Xix(Tx)) = - f Xl (x)log(f Xl (x))dx (28) 



and h(Xix(Tx)) is given by 



The lower bound on the minimum overhead incurred by location update information in bits/second, denoted by U(e 2 ), is 
given by 

U(e 2 ) > max (h(X a (Tx)) - \ log (27ree 2 ) , (29) 

In the remaining part of this subsection we evaluate f Xl (x) and R(e 2 ) for deterministic, uniform and exponential inter-arrival 
time distributions. 

1) Deterministic inter-arrival distribution: We first consider the case of deterministic arrival case because it is easy to obtain 
closed form results for this case and therefore we can focus on gaining insights without bothering much about complicated 
analysis. 

Suppose that packet destined to destination i is generated in the network after every T seconds, that is 

fs(t) = 6(t - T) 

For such an arrival process, f Xl (x) is given by 

fx-, (x) = c^"' a t 



SUBMITTED TO IEEE TRANSACTIONS ON INFORMATION THEORY, JANUARY 2007 



8 



and h(Xn(Ti)) is given by 



/itMTi)) = ^ log (27r e( T 2 r) 




(30) 



For the deterministic inter-arrival time the lower bound is similar to the minimum number of bits required to represent a 
Gaussian random variable with variance a 2 T subject to the constraint that the expected squared-error is less than e 2 . This 
is because the change in position of a node during a packet inter-arrival interval is indeed a Gaussian random variable with 
variance a 2 T . 

The equation (f3Qb bounds the minimum number of bits that a destination node must send to the location server for each 
packet destined to it that is generated in the network. For the deterministic arrival, the the packet arrival rate equals 1/T 
packets per second, thus minimum update rate in bits/second, represented by U(e 2 ), is given by 



Notice that R(e 2 ) increases with both a 2 and T. This is because larger a 2 and T would imply larger uncertainty in the change 
in position of the destination during the packet arrival duration and hence more bits are required to represent the destination's 
position for each new packet. U(e 2 ) also increases with a 2 for the same reason. However, U(e 2 ) decreases with increase in 
T. This is because the increase in update rate due to higher uncertainty associated with high T is over-compensated by the 
fact that larger T implies that updates have to be made less often. 

For deterministic inter-arrival time it is also quite easy to construct a strategy to update the location server that achieves 
the bound in PTT l. The following strategy achieves the bound: Each destination updates its location at time kT (k = 0, 1, . . .) 
seconds by encoding Af(0, a 2 T) Gaussian random variable corresponding to the change of position since last update such that 
the squared error distortion is less than e 2 . However in a real-life scenario the arrival times of packets destined to a particular 
destination is not known a priori, although we might have some estimate of the arrival rate. 

2) Uniform inter-arrival time distribution: Among all continuous time distributions with a given finite base and mean the 
uniform distribution has the highest entropy. Thus uniform distribution maximizes the uncertainty of packet arrival instances 
and thus would lead to maximum position update rate among all distributions with the same finite base and mean. 

Consider inter-arrival time to be uniformly distributed between [0, T] such that the inter-arrival time distribution is given by 




(31) 




(32) 



From d27l >, fx x (x) is given by 




(33) 



Let hjj denote the differential entropy of fx t (x), then the update rate per packet is given by 




(34) 



and the update rate in bits per second is given by 




(35) 
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3) Exponential inter-arrival time distribution: The motivation for considering the exponential distribution is that, among all 
the continuous time distribution with base [0, oo) and a given mean, the exponential distribution has the highest entropy. Also, 
the exponential distribution is widely used to model external inter-arrival time of packets to a given destination. 

We consider an exponential distribution with mean 1 /a, given by 

f s (t) = ae~ at 

^From to d27l ). f Xl (x) is then given by 

f°° 1 x 2 

-^- aT dr (36) 



/O V27T(T 2 T 

Unfortunately a closed-form expression for the above integral cannot be evaluated. So we will use numerical methods to 
calculate its value. 

Let He be the differential entropy of the distribution f Xl (x) given in d36l ). Then the lower bounds on update rate per packet 
and update rate per second are given by 

R(e 2 ) > max \h E ~ ^ log (27ree 2 ) , 0^ bits/packet (37) 
U(e 2 ) > a max (h B - ^ log (27ree 2 ) , 0^ bits/second (38) 

C. Location Overhead for Two-Dimensional Networks 

In this section we present the update rate analysis for two-dimensional networks, which is based on the analysis for one- 
dimensional case. We also evaluate the lower bound for various packet arrival processes and discuss the effect of arrival 
processes on the minimum update rate. 

Brownian motion in two-dimensional space may be decomposed into two independent one dimensional Brownian motions 
along x and y coordinates each with a variance er 2 /2. Thus if Xi(t) = {Xn(t), Xi2(t)} denote the coordinates of destination 
i at time t then the distribution of Xn(t) is independent of the distribution of Xi2(t). The following Lemma expresses R (e 2 ) 
in terms of components corresponding to the two coordinates. 

Lemma 2: For two-dimensional networks, the rate distortion R (e 2 ) function may be written as 

i?(e 2 ) = min i?« (k 2 ) + R ( V ( e 2 - k 2 ) (39) 

0<fc<£ 

where 

J?«(e 2 )= lim inf 1j Pb (4^) (40) 

N-*oo P N £V N (k 2 ) Jy 

R( 2 \e 2 )= lim iif (41) 

AT->oo P N ePjv(e 2 -fc 2 ) JV 



Proof: Recall the rate distortion function is 

R(e 2 )= lim inf l/ Pjv (xf;Xf) 

TV— oo P N £V N (e 2 ) N 
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Now consider Ip N (Xj v ; X^). For two dimensional networks, this may be written as 

T fvN. yN\ t ivN V N. yN\ 

-<-P]v( A i ) - 1 PN( Ji ll7 Ji i2, A l ) 



!pn i x a I X i ) + i Pn ( x a ; x i I x a ) 
i Pn PQi i X i ) + i Pn { X ii'i X i ) 

^Pw(-^aj-^il) + ^Pn^H' X i2\ X il) +IPrt{Xi2',X-2) + Ip N (X^;X^_\X^) 



IPn ( X il > X il) + I Pn ( X i2 > X i2 ) 



(42) 



where = {XaiT^Xain), . . . ,X a (T N )}, X" = {X a (T{), X l2 (T 2 ), . . .,X i2 {T N )}, Xg = {^(Ti), X a (T 2 ), . . .,X a (T N )} 
and Xg = {Xa{Ti),Xa(T 2 ), . . . , X l2 (T N )}. 
We know that DiN < ( 2 implies 



1 N 1 N 

- [(Xa(Ts) - tuft)) 2 ] [(X l2 (T 3 ) - X l2 (T ] )) 



3 = 1 



The distortion constraint is satisfied if ± J2jLi E i x a{ T j) ~ x n{Tj)) 2 < k 2 and ^ J2j=i E (X i2 (Tj) - X l2 (T 3 )f 
e 2 — k 2 . Combining this and (1421 1. we get 



< 



1 



0<k<e N— >oo PjvePjvCfc ) iV 

which leads to d39b . 

/From Lemma Q] and Theorem Q] it follows that 



inf 



1 



P N eT'jv(e 2 -fc 2 ) N 



(43) 



i?W(fc 2 ) > hiXain)) - log (27refc 2 ) 
i? (2) (e 2 _ fc2) > ^^(Ti)) - log (27re(e 2 - fc 2 )) 



(44) 
(45) 



/From d44b and d45l ). it is clear that the right hand side of d39b is minimized for k 2 = e 2 /2. This leads to the following 
theorem. 

Theorem 2: In order to ensure that the average error in location information used for forwarding packets is less than e, the 
lower bound on the location update rate (in bits per packet) for a two-dimensional network is given by 



R{e 2 ) > max (h(X tl (Ti)) + h(X i2 (T x )) - log (Tree 2 ) , 0) bits/packet 
and the overhead incurred in bits/sec (U(e)) is given by 



U(e 2 ) > 



1 



(/ipQi(Ti)) + h{X i2 {T x )) - log (Tree 2 ) , 0) bits/ 



sec 



(46) 



(47) 



We now derive the lower bounds for deterministic, uniformly distributed and exponentially distributed inter-arrival times. 
1) Deterministic packet arrival: For the deterministic packet arrival process, where packets arrive at t = kT, k = 1, 2, . . . oo, 
the probability distribution functions of Xn(T\) and Xi 2 (Ti) are given by 

1 



fx 1 (x) = fx 2 (x) 



V na 2 T 



e <* 2t 



(48) 



and h(Xu(Ti)) and h{X i2 {T x )) is given by 

h(X ix {T x )) = h{X l2 {T 1 )) = ~ log (nea 2 T) 

Thus the lower bound on location update rate in bits/packet and bits/second is given by 

'a 2 T\ \ 

, bits/packet 



R(e ) > max log 



U(e z ) > - ( log 



a T 



, ) bits/second 



(49) 
(50) 
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Variance of Brownian Motion (a 2 ) 



Fig. 1 . Update rate vs variance of Brownian motion. 



The behavior of R(e 2 ) and U(e 2 ) is similar to that observed in the one-dimensional case. In fact the minimum rate is simply 
the double of what was observed in the one dimensional case. This is not surprising since the change in x-coordinate of the 
destination node is independent of the change in the y-coordinate of the destination. The change has variance tr 2 /2 rather 
than a 2 as in the case of one-dimensional motion thus one might expect the minimum rate in 2-D to be less than 2 times 
the minimum rate in 1-D case. However this decrease in entropy due to decreased variance is compensated by the fact that 
allowable squared error in both the coordinates is also decreased, leading to a factor of 2. 

2) Uniform distribution of packet inter-arrival time: For the uniform packet arrival process described in d32l . the probability 
distribution functions of Xn(Ti) and X^iTt) are given by 



fxd*) = /*.(*) = \l-^r e ~^ + ^ erf (-rm) ~ ^ (51) 
V ira 1 a L \\Ja 2 T) o r 2 

Let hjj denote the differential entropy of Xn(Ti) and X^Tx) i.e. hjj = h(Xn(Ti)) — h(Xa(Tx)), then the lower bound on 
update rate is given by 

R(e 2 ) > max (2h v - log (Tree 2 ) , 0) bits/packet (52) 
2 

U{e 2 ) > — max [2h v - log (Tree 2 ) , 0) bits/second (53) 

3) Exponential distribution of packet inter-arrival time: For the exponential packet arrival process, the probability distribution 
functions of Xn(Ti) and Xi2(7i) are given by 

fxM = /x a (x) = £ -^=e-(^ +aT )dr (54) 

Let He denote the differential entropy of Xn(Tx) and Xa{Tx) i.e. He — h(Xn(Ti)) — h{Xii{Ti)), then the lower bound on 
update rate is given by 

R(e 2 ) > max (2h B - log (Tree 2 ) , 0) bits/packet (55) 
U (e 2 ) > a max (2Jie — log (nee 2 ) , 0) bits/second (56) 

4) Comparison of update rates for various inter-arrival processes: Figures Q] and [2] show the plot of the lower bound on 
U(e ) against a 2 and E[S]. It is observed that for high a 2 and low E[S], the rate at which a source must update its location 
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Mean Packet Inter-Arrival Time 



Fig. 2. Update rate vs. mean packet inter-arrival time. 

servers becomes very high. Also it is observed that the rate required for deterministic packet arrival is higher than that required 
for uniform and exponential arrival processes. In fact the update rate for deterministic packet arrival process is higher than any 
other packet arrival process with the same mean inter-arrival time. Consider a packet arrival process with pdf f$ (t) and mean 
E[S], then Var (Xn(Ti)) and Var {X i2 {T x )) are given by 

r r #-±=e-&6xf a {t)*t = r ^fs(t)dt = (57) 
Jo J-oo V iter l Jo L L 

This implies that notwithstanding the packet arrival process, the variance of the change in location between two packet 

arrival instances depends only on a and E[S]. For the deterministic packet arrival process, Xn(Ti) and Xi%(Ti) are Gaussian 

random variables d48l . Since among random variables with the same variance Gaussian random variable has the highest entropy, 

deterministic packet arrival leads to the highest update rate. 

V. Beacon Overhead 

In this section we evaluate a lower bound on the minimum rate at which the nodes need to transmit beacons so that 
neighbors maintain a consistent neighborhood information. We then bound overhead in terms of bits per second. We first 
consider Brownian motion in one dimension and later extend the results to two dimension case. 

A. Notation and minimum beacon rate formulation 

All the notations defined in subsections [Til] and IIV-AI are used throughout this section. In this subsection we define some 
additional notations for the analysis. 

Definition 8: Ni(t) is the set of nodes that belong to the neighborhood of node i. That is 

Ni(t) = {j : \Xi(t) - X 3 (t)\ <r,l<j<n,j^i} (58) 
Definition 9: Ni(t) is the set of nodes that the node i perceives to be its neighbors. 

The set Ni(t) is constructed by node i based on the beacons it receives. A node that belongs to iVj(t) may be excluded from 
Ni(t + t) if sufficient beacons are not received from the node during time interval [t,t + r]. Similarly, a node not belonging 
to Ni(t) may be included in iVj(f + r) if sufficient beacons are received from the node during time interval [t,t + t]. The 
deviation of Ni(t) from Ni(t) depends on the rate at which the nodes transmit beacons. 
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Definition 10: Zij(t) and Zij(t) (1 < i, j < n, i ^ j) are indicator random variables, defined in the following manner 

if j e NAt) 

Z ij {t)={ U (59) 

otherwise 

and 

fl, \ijeNAt) 

Z ij (t)=\ lW (60) 

I 0, otherwise 

In other words, Z^ (t) equals 1 if node j belongs to the neighborhood of node i at time t. Note that Zij (t) is a symmetric 
relation, i.e. Zij(t) = Zji(t). On the other hand, Zij(t) is 1 if node node i perceives node j to be its neighbor at time t. Unlike 
Zij(t), Zij(t) is not symmetric. That is Zij(t) may be although Zji(t) is 1. This may happen because beacon transmission 
rate of i is high enough to allow j to maintain consistent neighborhood set while beacon transmission rate of node j is not 
high enough to allow node i to maintain consistent neighborhood set. The deviation of the perceived neighborhood, Ni(t), 
from the actual neighborhood, Ni(t), is reflected by the deviation of Z^it) from Zij(t). 

Definition 11: The difference of Zij(t) and Zij(t) is defined as Eij(i), i.e. 

E ii {t) = Z ij {t)-Z ii (t) (61) 
Eij(t) = implies that node i has accurate information about whether j belongs to its neighborhood or not. It is not 
necessary that Eij(t) = for all t, however it is desirable that Eij(t) = with high probability at all time instances when 
node i has a packet to forward. This is because correct neighborhood information is highly critical for node i to make correct 
forwarding decisions. 

We can now state the minimum beacon rate problem in the following manner. 

Minimum beacon rate problem: What is the minimum rate at which node j must transmit beacons such that 

P[Eij(Ti(k)) = 0]>l-5yi<i^j<n,l<k<oo (62) 

In order to formulate the above minimum beacon rate problem as a rate distortion problem we present two more definitions. 
Definition 12: Let the vectors Zf- and Zf- be defined in the following manner 

Z% 4 {Z ij (T i (l)),Z ij (r i (2)),...,Z ij (r i (N))} (63) 

Z% 4 {Z ij (r i (l)),Z ij (r i (2)),...,Z ij (r i (N))} (64) 
Definition 13: Let V$ (8) denote the family of joint probability distribution function of Zf- and Zf- such that P[Eij (r^fc)) = 
0]<l-<5Vl<fc<7V. 

The superscript in V$(6) is used in order to distinguish the notation from the one used in the previous section. This 
superscript will be used for similar purpose in the rest of this section. 

Thus the minimum beacon rate, (S), may be expressed in the following manner 



?(*>) 

1L ' I U I 11111 111111 1 

where 



R^ b \6)= lim mmR^(5) (65) 



R<$(5)= min ±I PN (Z%;Z%) (66) 

and Ip N (Z-j-,Z-j) is the mutual information between Z-j and Zf-. In the next subsection we evaluate a lower bound on 

R^(S). 



B. Beacon Rate Analysis for One-Dimension Networks 

We first present the lower bound on the beacon rate of a node for the one-dimensional case and then extend the result to 
the two dimension case. 
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Lemma 3: The minimum beacon rate of node j, R^(8) is greater than equal to R^\d), that is 

R W (S) > Rf\s) (67) 

The proof of the above Lemma is similar to that of Lemma Q] 

Following the result of Lemma [3] we need to find a lower bound on R^\6) in order to bound R( b \6). We first find a 
lower bound on R±^ for the case when .Zy(O) = 1 and then find the bound for the case when ^j(O) = 0. The bound for the 
these two cases provide answer to the following question: (i) If i and j are neighbors at time t = 0, then at what rate must j 
transmit beacons such that node i knows whether j is neighbor of i or not when it has a packet to send at t = Tj(l); (ii) If 
i and j are not neighbors at t = 0, then at what rate must j transmit beacons such that i knows whether j is a neighbor of 
i or not when it has a packet to send at t = Till). Since the constraint P[£ , y(rj(l)) = 0] > 1 — 8 has to be satisfied for all 
1 < i < n, whether i is a neighbor of j or not at t = 0, the lower bound is maximum of beacon rate for the two cases. 

Lemma 4: If Zij(0) = 1, then K^{8) is bounded by 



where 



5 8 

i?i°V)> max H{Z ij {r i {l)))-H[-,l-8,-) (68) 

X j ( ) £ L i 



and 



n(^S, S -\A-Slo g r-) (1 .Ulu^l S ) ,69, 



Li = [Xi(0) — r, Xi(Q) + r] (70) 



i.e. Li is the set of possible positions of j at time t = such that 2^, (0) = 1. 
Proof: Recall that i?^ (5) is given by 

4 b) (<*) = inf m ^ (ri ) ; 4" (n)) (71) 

Now I Pl (Zij(Tx); Zij(T{j) is given by 

/Fi(Zy(Ti);£«(7-i)) = ff(Z ij ( n ))- J ff(Z ij (n)|4-(T 1 )) 

= H(Zij (ri)) - H(Z«(ri) - 4-(n)|4-(ri)) 
> H(Zij(Ti)) - H(Zij(ri) - Zij(n)) 
= HiZijin)) - HiEijin)) (75) 

We know that the probability distribution of Eij (t\ ) is given by 



(72) 
(73) 
(74) 




E ij (r 1 )) = {o, w.p. P2 (76) 



where p2 > 1 — 8, p% + P3 < 8 and p\ + P2 +P3 — 1 (since Pi £ Vx(6)). Under these constraints H{Eij{ri))) is maximized 
when p 2 = 1 - 8 and Pl = p 3 = 5/2, when H(E ij (T 1 )) = W (|, 1 - 8, §). Thus 



7 Pl (^(n); 4-(n)) > H{Z l3 { Tl )) -h(~,1-6, (77) 

Now H(Zij(T\)) depends on the position of node j at t = 0, ^j(0). We know that j € iVj(0) which implies that Xj(0) G 
Depending upon / r (t), H(Zij (ti)) is maximized for some X,(0) = x E Li. In order to ensure that P[.Ey (-rj(fc)) =0] < 1 — <5 
V i, the beacon rate must take care of this worst case. Thus 

/A(2«(n);2«(n))> max H(Z«(t»(1))) - W f ~ 1 - 5, f 
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^From the above equation and dTTT i we get d68l l. ■ 

We now outline how to evaluate H(Zij(Ti(l))) Without loss of generality we may assume that Xj(0) = 0. So if Zy(0) = 1, 
then Xj(0) — I where — r < I < r. ^From the point of reference of node i, node j performs Brownian motion with variance 
2a 2 . So 

p[^(n(D) = ipr j( o) = i e LiM i) = A = i er f (-^=L) + i er f (j=j (78) 

Now using the fact that the inter-arrival time of packets to be served by i has distribution f T (t), we get 



P[Z l3 ( Tl (l)) = 11^(0) =1^,]^ jT erf (jj=\ fr(t)dt + \ jT erf (^=) fr(f)dt (79) 

We know that E{Z it ;(t;(1))) = H(P[Z i3 ;(t;(1)) = 1]), where H(x) = -xlog(a;) - (1 - x) log(l - x) (0 < x < 1). We know 
that 7i(x) is maximum at x = 0.5, symmetric about x — 0.5 and is strictly increasing and decreasing in the interval [0,0.5) 
and (0.5, 1] respectively. 

Definition 14: Let I* be defined as 

r^arg min |P[Z«(ri(l)) = l|X,-(0) = 1} - 0.5| (80) 

— r<l<r 

In other words Z* is the value of Xj(Q) for which if(^jj(r,-(l))) is maximized. This leads us to the following Corollary. 
Corollary 1: The minimum beacon transmission rate of node j, denoted by i?( bl '((5), such that with probability at least 
1 — 5 the current neighbors of j know whether j belongs to their neighborhood at the time of forwarding next packet equals 

PS bi \5) >H{p(l*))-H h^-^Q beacons/msg (81) 

where p(l*) = (^(1)) = 1||X,(0) - X 4 (0)| = I*] and I* is given by ([80j. 

We now turn our attention to the second question i.e. at what rate should j transmit such that with probability at least 1 — 6 
all the nodes that are not neighbors of j at t = know whether j belongs to their neighborhood or not when they have a 
packet to forward? 

It is easy to see that Lemma [4] holds even if (0) = 0. We formally state a similar Lemma without proof for the case 
when Zij(0) = 0. 

Lemma 5: If Zij (0) = 0, then R j (6) is bounded by 

x ,(b) - (S - - <* 

where H (|, 1 — 5, 4) is given by d69l and L[ is the set of possible positions of node j at t = such that = 0, i.e., 

{z : | x - X;(0)| > r} (83) 



Rf>> max H^TiWJJ-H (82) 



Again, without loss of generality we assume that Xi(0) = 0. Let Xj(0) — I, such that \l\ >r, i.e. j does not belong to the 
neighborhood of i at time t = 0. P[^y(rj(l)) = l|X/(0) = Z, |Z| > r, Ti(l) = r'] is given by 

P[^(l)) = 1|X,(0) = I, |H > rMD = A = ^xf (^=) - ierf (-|=L) (84) 

Thus 

P[Z t] {n{l)) = 11^(0) = > r] = ^"erf (4=) Mt)dt - ± J°° eri (^=) /r(*)* (85) 

Note that that P[^y(ri(l)) = 1|X,(0) = i, |Z| > r] < 0.5 and its value is maximized for \l\ — r. Since Ti(x) is an increasing 
function of x in the interval [0, 0.5), the \l\ = r maximizes the value of H(Z tj (T t (l))) = H(P[Z ij (r i (l)) = 1|X,(0) = I, \l\ > 
r]). This leads to the following Corollary. 

Corollary 2: The minimum beacon transmission rate of node j, denoted by R^ b2 \S), such that with probability at least 
1 — 5 the nodes that are not current neighbors of j know whether j belongs to their neighborhood at time of forwarding the 
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next packet equals 

/'A ,i\ 

/msg (86) 



R (b2) (S) > H( P (r)) -h(^,1-S, beacons/i 



p(r)=J r eri ( ^= ) f T (t)dt 



where 

7?(Y = , ... 

To summarize, Corollary Q] provides a lower bound on the beacon transmission rate such that each of the current neighbors 
are able to maintain consistent neighborhood information with high probability. Corollary [2] provides a lower bound on the 
beacon transmission rate of a node such that all nodes that are not neighbor of the node know with high probability if the 
node joins their neighborhood. The minimum beacon transmission rate is therefore the maximum of the two rates given by 
dSB and (ED. 

However note that according to the definition of I*, Ti(p(l*)) > TL{p{r)). This implies that the lower bound on R^ b2 \8) in 
(l86l l is always less than the lower bound on R^ bl ^{5) in (18 It . Thus the lower bound on the minimum beacon transmission rate 
is given by the lower bound on R^ bl '{8) in ( 18 It . The following theorem formally states this discussion. 

Theorem 3: The lower bound on the minimum beacon transmission rate of a node such that the constraint in equation ( l62l 
is satisfied is given 

R^(S) > max(i?( bl )(<5),i? (b2) (<5)) (87) 



> H(p(l*)) -h(^,1- S,^) beacons/msg (88) 

where R^^S), i? (b2) (c5) and I* are given by (ED, (E3 and d80j respectively. 

When H(p(l*)) < H (§, 1 - 5, §), left hand side of dH will be meaningless. Thus (|88l may be expressed as 

R (b \5) > max ( H(p(l*)) -H ( -5, £ ] ,0 ] beacons/msg (89) 



2 2, 

The minimum overhead in bits/second, denoted by U^ b '(S), is given by 

U^(S)> JL m ^{n(p(l*))-H (i^-S,^) ,<>) bits/sec (90) 

where E[t] is the expected packet inter-arrival time and B is the size of beacon packet in bits. 

We now consider various packet inter-arrival distributions, f T (t) in order to obtain more insights into the results obtained 
in this section. 

1 ) Deterministic inter-arrival time : Consider deterministic inter-arrival time where at each node a packet to be forwarded 
arrives every T seconds. For this case f T (t) is given by 

f T (t) = S(t-T) 

So p(l) 4 P[Z«(7j(l)) = l|X,-(0) =!£i 4 ] is given by 

p(l) = P[Z l3 (n(l)) = l|X,-(0) = I S L t ] = 1 erf (;^=) - T)dt + ^ erf (^=) - r)dt 

= lferff4^Lf + erff4iL^ (91) 

In order to find i?( b )(<^ we need to find Z* as defined in ( |80l . Note that p(Z) is decreases monotonically as I increases form 
to r. Thus Z* may take the following values: (i) If p(r) > 0.5, then Z* = r; (ii) If p(0) < 0.5, then Z* = 0; and (iii) If 
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p(0) > 0.5 and p(r) < 0.5, then < I* < r and H(p(l*)) — 1. Thus we have the following relation 

fmax(w(|erf( 7 ^-))-W(|,l-J,f),o), if p(r) > 0.5 
i? (b) W > J m ax(w(crf(^2_)) _ W (g, i _ J, |) , ) , if p(0) < 0.5 (92) 
[max(l- ,0), if p{r) < 0.5 < p(0) 

The above relation may be expressed in a more compact manner in the following manner 

i^)(S)>max(w Q e rf(-^))w(p(r)-0.5)+w(erf(-^^ 

— 7Y 1 — (5, ^ , 0^ bcacons/msg (93) 

where U (x) is the unit step function, defined by 

f 1, x > 

«(a0 = < " (94) 

I 0, otherwise 

Let B represent the size of a beacon packet in bits, then the overhead in bits/second is given by 
UW {s) > | max Q erf (-^==) ) W(p(r) - 0.5) + U (erf (^7=ff ) ) W(0.5 - p(r)) 

+ 1 • W(0.5 - p{r))U{p{Q) - 0.5) - 7Y ( |, 1 - (5, | ] , ) bits/sec (95) 



2) Uniform inter-arrival time distribution : Now suppose that the packet inter-arrival time has a uniform distribution given 

by 

i < f < T 

fr(t) = { T ~ ~ (96) 
I 0, otherwise 

Thus p(l) 4 P[Z y -(r l (l))|X J (0) = 1 6 LJ is given by 

p(0 = P[Z,(^(1))1|^(0) =leL i} = ^f-f (^) * + erf (^=) A (97) 

Evaluating the first integral in the above equation we get 

-1 /"erf (±±) dt t^erf ( ^-L) + r ~ f exp f-ll^) + ±erf (j^U - ^ (98) 
Similarly the second integral yields 

-1 /"erf A - ^±^erf f^±M + -J±i_ exp ( J-L±ll) + I erf ( J^M _ £±£ (99) 

Thus is given by 

p(0 = ^ crf (tp?) + ^ crf (tp?) + exp (- w) + vS4 cxp (- w) 

+ 1 ferff^L) + erff4±L)V ! 4l («») 

Since erf (-7==^ + erf (-7==^ strictly decreases with Z, p(Z) is also a strictly decreasing function of Z. Thus, similar to the 
deterministic arrival case, Z* may take the following values: (i) If p(r) > 0.5, then I* = r; (ii) If p(0) < 0.5, then I* = 0; 
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and (iii) If p(0) > 0.5 and p(r) < 0.5, then < I* < r and H{p{l*)) = 1. Thus 



•ierf 



T 2 T 



r 2 



exp 



7 2 r 



v / ir<J 2 T 



exp 



(- r2 



-erf 
[0.5, 



2<r 2 T' 



if p(r) > 0.5 

if p(0) < 0.5 

if p{0) > 0.5 > p(r) 



(101) 



Substituting the above expression of p(l*) in the expression for the lower bound on the beacon update rate for the case of 
uniform inter-arrival time distribution may be calculated using {89). For uniform distribution of inter-arrival time, the lower 
bound on the overhead caused by the beacon updates is thus by 



U {b) (S) > ^ max (n(p(l*)) - H 1 - 5, ,0 ) l,ils/s<r 



(102) 



where B is the size of beacon packet and p(l*) is given by (1101b . 

3) Exponential inter-arrival distribution : Suppose that the inter-arrival packet time is exponentially distributed with mean 



— i.e. 

a 

Now for the exponential distribution p(l) is given by 

a 



frit) = 



p(0 



erf 



r + l 



e~xp(—at)dt 



(103) 



(104) 



Since p(l) is a decreasing function of /, p{l*) is given by 



p(r), if p(r) > 0.5 
p(0), ifp(0)<0.5 
0.5, if p(r)> 0.5 
and a lower bound on overhead caused by beacon updates is thus given by 

U {b \5) =aBmax(n(p(l*))-n(^,l-5,^) ,0 



(105) 



(106) 



It is not possible to find a closed form expression for the integrals in equation ( 1104t and numerical methods may be used 
to evaluate the lower bound on (5) for the exponential inter-arrival time distribution. 



C. Beacon Rate Analysis for Two-Dimensional Networks 

In this subsection we extend the minimum beacon rate analysis to two dimensional networks. For a arbitrary node pair i 
and j, we choose an orthogonal coordinate system such that Xn(0) = Xi2(0) = 0, X,i(0) = I, and Xj2(0) = 0. That is, 
the origin of the coordinate system corresponds to the position of node i at t = and the x-axis of the coordinate system 
corresponds to the line joining the position of nodes i and j at t = 0. It can be easily verified that a Brownian motion with 
variance a 2 can be decomposed into two independent Brownian motions with variance a 2 /2 along each axis. Also note that 
Lemmas [3] and |4] hold for the two dimensional case as well and may be proved in a similar manner. Thus the minimum beacon 
rate, R^(S), satisfies the following relationship 

R (b) (S) > H (Z«( ri (l))) -n(l,l-8, (107) 

Similar to the approach in the last section, we proceed by individually considering the cases Zij(0) = 1 and Zij(0) = 0. 
For the case when Zy(0) = 1, the probability that j is in the neighborhood of i when i has a packet to send (p(l)) is given 
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by 

p(0 4 P [Z y (r) = 11^(0) = I, X j2 (0) = 0, \l\ < r] = / P [X,-i(r<(l)) - <r|X,-i(0) = Z] 



P 



-\A 2 -x 2 < ^(^(l)) < Vr» - a;2|X j2 (0) = 



Relative to node i, node j performs Brownian motion with variance 2a . Thus 

P [X jx {T) = x\X sl (Q) = I] = -j=L= exp f-^=^ ) rfx 



and 



Therefore, 



P 



Vr 2 -i 2 < X i2 (r) < Vr- 2 -x 2 |X i2 (0) = 



= erf 



\/ r 2 — 



x~ 



V2^ 



T 



p(0 - / / -f== exp ( ) erf ( ) f T (t)dxdt (108) 



/o 7-r V 2a 2 < y ^ ^2aH 

Thus in order to satisfy ( |62l at all neighbors that are neighbor at time 0, node j must transmit beacon at a rate higher than 

d09) 

where is given by (l80l . 

Now consider the case when j does not belong to the neighborhood of i at t = 0. It can be easily verified that the probability 
that Zij(Ti(l)) = 1 given that Zij(Q) = (p'(l)) is given by the same expression as p(l) in equation (11081 l. p'(l) increases 
with decrease in \l\ and is maximized for \l\ = r. For other values of I > r, p'(l) < 0.5. Thus similar to the one-dimensional 
networks, the beacon transmission rate is determined by the rate required to satisfy (l62l at the initial neighbors. This leads the 
following theorem. 

Theorem 4: The lower bound on the minimum beacon transmission rate of a node such that the constraint in equation 
is satisfied is given 

beacons/pkt (110) 



R( b \S) > max (h (p(i*)) - H 1 - S, , 0^ 



where p(l) and I* are given by ( 11081 > and ( |80] > respectively. The beacon transmission overhead in bits per second, U^(S), is 
given by 

U^(6)> ^jL max 1 " Q -°) bits /sec (111) 

D. Comparison of Beacon Transmission Rates for Various Arrival Processes 

The closed form expression for the integral in J 108b cannot be found. So we use numerical computations to evaluate i?^ b )(5) 
for deterministic, uniform and exponential packet arrival processes. Figures [3] and [4] show plots of minimum beacon rate 
in bits per second. Figure [3] shows the plot of minimum beacon transmission rate against variance of Brownian motion for 
different mean packet inter-arrival times. It is observed that for low variance the rate is almost constant, while as the the 
variance increases the rate starts decreasing. When the variance of Brownian motion is very small, the variance of the change 
in position of a node within a packet arrival epoch is also small. For this case, I* = r and p(l*) ~ 0.5 which leads to high 
beacon rate. As the variance increases, the probability that two neighbors remain neighbors at the end of a packet arrival epoch 
is very small, no matter what the initial position of nodes might be. That is, when a 2 is high, p(l) < 0.5 V I, which leads 
to low beacon rate when variance is high. This is illustrated in Figure [5] Suppose the two nodes shown in the figure have 
communication range of two units. Initially nodes 1 and 2 are located at X = and X = 2 respectively. With respect to node 
1, the pdf of position of node 2 when the next packet is forwarded is shown for a 2 = 10 and a 2 = 80. It is observed that for 
a 2 = 10 the probability that nodes 2 will be in neighborhood of node 1 at the next packet arrival instant is approximately 0.5. 
This corresponds to large uncertainty which leads to high beacon update rate. On the other hand when a 2 = 80 the probability 
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Fig. 3. Beacons per second versus variance of Brownian motion. 




Fig. 4. Beacons per second versus the mean inter-arrival time of packets to be forwarded at a node. 



that nods 1 and 2 are neighbors at the next packet arrival instant is less than 0.5. It is therefore more likely that the two 
nodes will not be neighbors which reduces the uncertainty and hence leads to lower beacon update rate. The trend shown 
in Figure [3] implies that when nodes are highly mobile they need to transmit beacons less frequently and the membership 
of nodes in a neighborhood may be more efficiently deciphered by the absence of beacons. Figure [4] shows that as the rate 
of packet arrival increases, the beacon overhead may become prohibitively high. Also, for a given packet arrival rate, it is 
observed that the rate for a deterministic packet arrival process is smaller than that for exponential and uniform arrivals. This 
is because the probability that a node leaves the neighborhood of a certain neighbor within a packet inter-arrival duration is the 
highest (p(l) is close to 1) for deterministic arrival. For the uniform and exponential distributions the probability that packet 
inter-arrival time is less than the mean inter-arrival time is 0.5 and 0.63 respectively. Thus the probability that a node moves 
out of neighborhood during an inter-arrival duration is smaller than that for the deterministic arrival process. 
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Fig. 5. The pdf of position of node 2 with respect to node 1 given their initial positions. 
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Fig. 6. Capacity deficit versus number of nodes. 



VI. Capacity Deficit 

A wireless ad hoc network is said to transport one bit-meter when a bit is transmitted over a distance of one meter [5], The 
transport capacity of a network (in bit-meters per second) is defined as the supremum over the set of feasible rate vectors of 
the distance weighted sum of rates [27]. The transport capacity is expressed as XnL, where A is the average arrival rate at 
the nodes, n is the number of nodes and L is the average distance traveled by the bits. It is shown in [5] that the transport 
capacity of an arbitrary wireless network is 9 {W\fnA^j where n, W and A are the number of nodes deployed, transmission 
rate of the nodes and area over which the network is deployed respectively. It is shown in [5] that for a particular interference 
model known as the Protocol Model, the upper bound on the transport capacity of an arbitrary wireless network is given by 

— v/8 1 

XnL < — WVnA bit — meters/second (112) 

Let r) denote the expected distance between a node and its location server. Thus, on average, the location update information 
of a node travels at least r\ meters before reaching its location server. Thus the average overhead incurred by a node for 
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□ Deficit Due to Location Updates 

□ Remaining Transport Capacity 
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Fig. 7. Capacity deficit versus packet inter arrival time. 



updating its location information is at least i]U (e 2 ) bit-meters/second and the overhead incurred by location update information 
on the network equals at least nr]U(e 2 ) bit-meters/second, where U(e 2 ) is given by d47l i. A beacon transmitted by a node 
travels a distance equal to the communication radius. Thus the overhead incurred by the beacon packets on the network is at 
least nrU^{S), where U {b \S) is given by (II 1 lb . Thus the total transport capacity deficit due to the routing overhead is at 
least nrjU(e 2 ) + nrU^ b \S) bit-meters/second. This leads to the following theorem. 

Theorem 5: For the Protocol Model, the upper bound on the residual transport capacity available to an arbitrary network 
for transmitting data (XnL) is given by 

— /8 1 

XnL < — — WVnA-nnU(e 2 ) - nrU {b) {5) bit - meters/second (113) 

7T A 



Theorem [5] has interesting implications. The raw transport capacity of a wireless network scales as y/n while the overhead 
incurred by the routing overheads scales as n. Therefore if the number of nodes deployed in a network increases beyond a 
certain threshold, denoted by n*, then no useful information may be transported in the network and the whole capacity is used 
up by the geographic routing overheads. This leads to the following corollary. 

Corollary 3: For geographic routing, the upper bound on the maximum number of nodes that may be deployed in a network 
while ensuring that it has non-zero residual transport capacity is given by 

71 - [nU(e 2 ) + rU( b )(S) ) (U4) 

Proof: If the residual transport capacity is greater than zero then 

/8 i 

— -r wVnA- nr]U(e 2 ) - nrU (b) (S) > 

7T A 

which implies that 

V& i 



( V U(e 2 ) 



Rearranging the above equation yields (11 141 i. ■ 
Figures [6] |7] and ?? show the lower bound on the fraction of transport capacity that is used by the routing overheads. 
Figures [6] and [7] illustrate that for large network size and packet arrival rate, complete transport capacity of the network may 
be occupied by the routing overheads. 
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VII. Discussions 

In this section we briefly discuss some extensions of the rate-distortion formulation proposed in this paper. We first discuss 
the cases where our formulation and analysis yield trivial lower bound of zero. We then discuss how the formulation and 
analysis may be applied to other scenarios such as other location services, mobility models, etc. We then address some 
possible extensions to the model and analytical approach used in this paper. 

A. Trivial lower bound scenarios 

The lower bound on the rate at which a node needs to update its location server, derived in Section llVl may be zero in certain 
cases. This lower bound is trivial and does not provide much useful information. In this subsection we provide a technique 
for obtaining more useful bounds for such cases. For clarity, we first consider the location update rate for one-dimensional 
networks and then explain how similar technique may be applied to two-dimensional networks. We also comment on cases 
where the results of Section [V] yield trivial lower bounds on minimum beacon overhead rate. 

The lower bound on location overhead evaluated in IIV-BI is zero when /i(JQi(Ti)) < \ log (2iree 2 ). This occurs when 
the second moment of the change in position of a node during a packet inter-arrival period is less than the fidelity criterion 
e 2 . This may be the result of low mobility (small a 2 ) or average packet inter-arrival time (small S[i5]). Although average 
change in position of a node during a packet inter-arrival time may be small, the position change accumulates over time. Thus 
instead of updating the location server between every packet inter-arrival duration, the nodes may need to update the location 
information once in a few packet inter-arrival durations. Thus we modify the problem formulation for minimum location 
update overhead to take this into account. The distortion criterion considered in Section |IV] is that the expected squared error 
in location information of a node must be less than e 2 at every time instant a packet destined to the node is generated in the 
network. We relax the distortion measure such that the squared error in location information is required to be less than e 2 for 
every k th generated for the node. 

Let fg(t) denote the pdf of T k , the time at which k th packet destined to a node is generated in the network. T k is simply 
the sum of k packet inter-arrival durations, each of which is independently and identically distributed according to fs{t)- Thus 
/!(*) is g iven b Y 

pOO 

flit) = ft 1 (t)*f s (t)= ft\t-T)f s (T)dT (115) 

JO 

fs(t) = fs(t) (116) 
where * is the convolution operator. The pdf of Xn(Ti-), denoted by fx 1 ( x ) is given by 

f°° 1 2 

f h Xl (x)= __ e -^:/*( T )dT (117) 

Jo vZira^T 

and H(Xn(T k )) is given by 



Let k* be defined as 



H(X a (T k ))=- /£»log(/*,(x))dz (118) 

J — OC 

k*±mmke ^k\H (X a (T k )) > 1 log (27ree 2 ) J (119) 

In other words, k* is the minimum number of packet inter-arrival durations after which the second moment of the change in 
position of a node is greater than e 2 . The value of k* will depend on a 2 and the distribution of packet inter-arrival duration. 
So if we consider the relaxed distortion measure where we are only concerned with the squared error in location information 
after every k* packet generation instances, then following the same steps as in Subsection IIV-BI we get the following lower 
bounds 

R(e 2 ) > i (H{X tl (T k *))-^\og(2iree 2 )^j bits/packet (120) 
U(e 2 ) > I ^- ] ^H(X ll (T k ,))^^\og(2TTee 2 )^j bits/second (121) 



SUBMITTED TO IEEE TRANSACTIONS ON INFORMATION THEORY, JANUARY 2007 



24 



Note that the right hand side of the above constraints is always positive and hence are more meaning for the cases where 
results of Section [IV] yield trivial bounds. In fact the bounds evaluated in Section [IV] are special cases of ( 1120b and (112U 
corresponding to k* = 1. 

Similarly for two-dimensional network we can find more meaningful bounds by considering the change in location during 
k packet inter-arrival periods. An analogous k* may be defined as 

k* 4 mmfc e {k\H {X a (T k )) + H (X t2 (T k )) > log (yree 2 )} (122) 

For the relaxed distortion constraints we thus get the following bounds for the two-dimensional case 

R{e 2 ) > ^(H(X a {T k *)) + H(X i2 (T k *))-log(Tree 2 )) bits/packet (123) 

U(e 2 ) > 1 -^^^H{X. ll {T k ,)) + H(X l2 {T k ,))~\og(7:ee 2 )) bits/second (124) 

The lower bound on the minimum beacon overhead, evaluated in Section [V] is zero when H(p(l*)) < H (§, 1 - 5, §). This 
may happen if the er 2 i?[T] >> 2r. When a 2 E[r] >> 2r, with probability greater than 1 — S a node will leave the neighborhood 
of its neighbors within a packet inter-arrival period. According to our formulation the node does not need to send a beacon as 
the neighbors may simply assume that it has left their neighborhoods. As a result the bound may be loose in such circumstances 
because it is inevitable that the nodes will need to exchange beacons in order to maintain consistent neighborhood information. 
The idea of relaxing the distortion constraint by considering accuracy of neighborhood information over several packet arrival 
intervals, as done for the location update case, does not work in this case. This is because as we increase the number of packet 
arrival intervals the probability that two nodes remain neighbors over that time duration decreases. However if the fidelity 
criterion is very strict (5 « 0) then the bounds will be meaningful over a wide range of a and packet arrival rate distributions. 



B. Application of the formulation to other scenarios 

In this section we comment on generality of the rate-distortion model and how it may be easily extended to other scenarios. 

1 ) Per-Session Location Discovery: The network model used in this paper assumed that the location server of the destination 
is queried at the arrival of each new packet. Another implementation of geographic routing is possible where a source node 
queries the location server only on the arrival of a new session. During the session, the source and destination may piggy-back 
their location information along with the data and ACK packets respectively. Of course such a scheme would work well only if 
the session involves flow of traffic in reverse direction (destination to source) and the time elapsed between arrival of packets 
in the session is not large. 

The overhead incurred in updating the location server for such a geographic routing protocol would be different from the 
results presented in Section [IV] Also other than the overhead associated with updating the location server, another overhead 
associated with sending the location update information along with every ACK/data packet is also introduced. 

The rate-distortion analysis of Section [IV] may be easily extended to analyze the overhead incurred by geographic routing 
with per-session location discovery. Let Si denote the time interval elapsed between arrival of two sessions destined for the 
same destination and let S 2 denote the inter-packet arrival time within a session. Let fs\ (t) and f$ 2 (t) denote the pdfs of Si 
and S 2 respectively, sing similar approach as Theorem [TJ the overhead associated with updating the location server is greater 
than equal to 

h(Xn (Ti)) - i log (2nee 2 ) bits/session (125) 

where 

r°° 1 x 2 

f Xl (x)= -==e-^ f Sl (r)dr (126) 
J T =o V 2tto- z t 

Similarly the overhead incurred in sending the location information along with ACK/data packets is greater than equal to 

2 (h(f' Xi {x))~- log (27ree 2 ) J bits/packet (127) 
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where 

r°° i 2 

f Xl (x) = - 7 ==e-^f S2 (r)dr (128) 

J T =0 V 27TC T 

The factor of 2 appears in the above expression because both the source and destination need to send their location information 
piggybacked with each data and ACK packet. The overall overhead incurred in maintaining location information is given by 

U(e 2 ) > (h(f Xl (*)) ~ ilog(27r ee 2 )) + (h(f Xl (x)) - \ log (27ree 2 )) bits/second (129) 

The total transport capacity deficit is greater than equal to 

^(.fe(.T))-ilog(27ree 2 )^) + Jj^ (h(f' Xl (x)) ~ \ log (27re £ 2 )) + nrtfW (5)bit - meters/second (130) 

2) Other Location Services: The network model used in this paper accounts for a location service where the location of 
each destination is maintained by a single location server. Often location services use multiple location servers for maintaining 
location information of each destination. It should however be noted that the minimum rate at which a node must transmit its 
location information does not change and is given by (125V The multiple location servers affects the distance traveled by this 
location information generated by a node. 

Consider a location service that assigns k location servers for each node. Each of the k location servers is updated by a 
node using k independent messages. Let r\i be the average distance between a node and its i th location server. The transport 
capacity deficit caused by a such a scheme would be greater than equal to 

]rj^j nil (e 2 ) + nrll {b) (5) bit - meters/second (131) 

In several location service schemes independent update messages are not issued. Instead the update messages may be routed 
along a multicast tree so that all the location servers are updated. In such a case, the distance traveled by the location update 
messages, 77, would be equal to the length of the multicast tree. For example, consider XYLS (also known as column-row 
location service) [28]. In XYLS, each node maintains its location at every node that lies within the column containing the 
node. This is done by sending an update message in the north-south direction. Every node that overhears the messages updates 
the location information. Thus in a square field of area A meter 2 , a location update bit would travel a/A meters, i.e., r\ = \f~A. 
Thus the transport capacity deficit caused is greater than equal to 

\fAnU (e 2 ) + nrU {h) {5) bit - meters/second (132) 

3) Hierarchical Location Services and Distance Effects: In hierarchical location service schemes, there exists a hierarchy 
of location servers [29], [30]. The level of the server in the hierarchy depends on the distance from the node it serves. The 
location servers closest to a destination, and lowest in the hierarchy, have the most accurate information about the destination. 
As we move up the hierarchy, the location information becomes less accurate. That is farther a location server is from the 
destination it serves, less accurate would be the location information maintained by it. This scheme works well due to the 
distance effect - greater the distance from a node, slower it appears to move. 

Let €i be the expected error in location information available with the location servers at level i. Let 7^ be the average 
distance traveled by bits in order to update the servers at level i. Thus the transport capacity deficit for such a hierarchical 
location service scheme is greater than equal to 

k 

n^U (e?) 77, + nrU {b) {5) bit - meters/second (133) 

i=l 

4) Other Mobility Models: In this paper we use Brownian motion to model node mobility. Although Brownian motion may 
not be the most realistic mobility model, the existence of closed form expression for node location pdf makes analysis tractable. 
Also Brownian motion allows us to change the degree of uncertainty in node position by changing a single parameter a. 

Many other mobility models for ad hoc networks have been proposed [31], [32], some of which claim to be a better reflection 
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of reality than the other. Our analysis may be easily extended to any other mobility model, as long as the expression for steady 
state probability density function of the node locations is know. The pdf of node location may be plugged into d27b in order 
to obtain fx x {%)■ However it is a non-trivial exercise to evaluate the closed form expression of the node location for mobility 
models like random way-point model, Gauss-Markov model, etc. 

C. Extensions to the Model and Analysis 

In this subsection we discuss some of the possible extensions to the model and analysis that may be incorporated in the 
future. 

1 ) Incorporating the Caching of Location Information: The network model does not take into account the caching of location 
information at the source and intermediate nodes. For example, the source node may cache the location information of the 
destination received from the location server and may add the cached information to the headers of future packets. The cached 
information may be used until it expires after some fixed time. Also intermediate nodes with fresher information regarding 
the location of the destination may update the packet header. Such a caching scheme has not been incorporated in our model. 
Caching may save the overhead associated with periodically querying the location server, although the location update and 
beacon overheads may remain unchanged. 

2) Exploring Closed Form and Tighter Bounds: Although the network model used in this paper is very simple, still it 
is not possible to obtain closed form analytical expressions for many quantities. For example, a closed form expression of 
minimum overhead incurred by location update packets cannot be derived if the packets arrival process is Poisson. Similarly 
for the minimum beacon overhead case, all the results are expressed in terms of TL (p (I*)). However numerical methods may 
be easily applied to evaluate the values of the expressions for given parameters. Finding tight closed form bounds for the 
expressions would also be the focus of future work. 

VIII. Conclusion and Future Work 

In this paper we studied the protocol overhead incurred by geographic routing in order to route packets with a given level 
of reliability. The protocol overhead is categorized into location update overhead and beacon overhead. For both kind of 
overheads, the problem of finding minimum overhead incurred is formulated as rate-distortion problem. For location updates 
we evaluate a lower bound on the minimum rate at which a node must transmit its location information to the location server so 
that the expected error in location information used for routing is less than a given value. For the beacon updates, we evaluate 
a lower bound on the minimum rate at which a node must transmit beacon packets so that the probability that its neighbors 
maintain a correct neighborhood information is greater than a given value. We first evaluate the bounds for one-dimensional 
networks and then extend the results to two dimensional networks. We also characterize the deficit in capacity caused by the 
routing overheads. 

Developing an information theoretic framework for evaluating the minimum protocol overhead incurred by several classes of 
routing protocols, such as proactive, reactive, and hierarchical routing protocols, is the ultimate goal of the framework proposed 
here. Such a universal analytical framework for characterizing routing overheads for all routing paradigms would be useful 
in determining which protocol is suitable for a given scenario. Development of efficient routing protocols for mobile ad hoc 
networks, using the information theoretic results as a guideline, would also be a part of future research. 
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